Search: drop use of TagCache, extract tags and tag values on-demand #1068

yvrhdn · 2021-10-21T13:26:05Z

What this PR does:
Instead of storing tags and tag values in the TagCache we can extract them directly from live traces, WAL and local blocks when requested. This should lower the amount of work spent on updating the tagcache during ingest.

Since tag values are extracted on-demand, we are not limited anymore by the size of the tag cache. So this makes it possible to return thousands of values when needed.

Which issue(s) this PR fixes:
~~Fixes #~~

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

yvrhdn · 2021-10-21T15:00:51Z

I've been doing a test on my local set up fetching tag values for load_generator.seq_num, this is a unique ID added to every trace by the synthetic-load-generator. Before these changes the amount of values always got truncated at about ~200 values. With this PR amount of values returned increases up to 80k and drops back to 20k when a block was flushed.

Avg latency sits around 200-400 ms.

yvrhdn · 2021-10-22T11:45:40Z

TODO

Check if the context has been cancelled
~~Experiment with filtering tag values on service.name~~ Next PR
Most time is spent searching live traces, can we speed this up?

tempodb/search/backend_search_block.go

modules/ingester/instance_search.go

… of data from flatbuffers. Add new FindTag function

Drop tag cache 2

# Conflicts: # CHANGELOG.md

yvrhdn · 2021-11-08T17:12:59Z

Still to address: when fetching tag values with very high cardinality (for instance http.url) requests fail/time out when being sent between components. I.e. the ingester is fine digging up all the values, but the GRPC connection between the ingester, querier and query-frontend was hitting the max bytes size...
We should add some limits (optionally configurable) to protect against this happening. Ideally, we return early when we have 'a lot of' values.

Future improvements (not for this PR): we can add filter capabilities to the tag values endpoint, so if someone starts typing http.url=/myService/ we only return values containing /myService/. This would make high cardinality tags more useable.

joe-elliott

Remove the TagCache struct?

yvrhdn · 2021-11-08T20:38:59Z

Remove the TagCache struct?

Yep 👍, I've removed tempodb/search/tag_cache.go, it's all the way at the bottom of the diff
https://github.com/grafana/tempo/pull/1068/files#diff-b29b479cf004a8bf08fb55114702b94f6d818911fca0c83c985e771db2fbe4bb

Signed-off-by: Annanay <[email protected]>

mdisibio

Thanks for getting the limit check in. Good to merge.

Signed-off-by: Annanay <[email protected]>

Koenraad Verheyden added 3 commits October 21, 2021 15:24

Search: drop use of TagCache, extract tags and tag values on-demand

654efd3

Fix compile-error in tests

f4faba8

fmt

e8b5d5f

Koenraad Verheyden added 3 commits October 21, 2021 18:20

Remove TagCache :cat-salute:

b02db6c

Add error handling

8a177ed

lint

30c453e

yvrhdn marked this pull request as ready for review October 21, 2021 17:17

yvrhdn requested review from annanay25, dgzlopes, joe-elliott, mapno and mdisibio as code owners October 21, 2021 17:17

mdisibio reviewed Oct 22, 2021

View reviewed changes

tempodb/search/backend_search_block.go Outdated Show resolved Hide resolved

mdisibio reviewed Oct 22, 2021

View reviewed changes

modules/ingester/instance_search.go Outdated Show resolved Hide resolved

Koenraad Verheyden and others added 10 commits October 24, 2021 03:14

Clean up and optimisations

ec96dc0

Merge branch 'main' into drop-tag-cache

c260a9a

Merge branch 'main' into drop-tag-cache

689eba8

Fix compilation errors

5f2d553

Refactor methods for consistency with instance.Search

f4b62c6

Tweak tag lookup binary search to eliminate last comparison and fetch…

d16fef7

… of data from flatbuffers. Add new FindTag function

Use tempofb.FindTag

ba89666

Add SearchDataMap RangeKeys and RangeKeyValues

fbb66c1

make fmt

f3dfef9

Update CHANGELOG.md

744be0c

yvrhdn requested a review from mdisibio October 26, 2021 23:07

Koenraad Verheyden and others added 3 commits October 27, 2021 15:04

Cast to string once

b8724ee

Reuse SearchEntry buffer where possible

16570c7

fix key/value typo

ab3b5b4

Koenraad Verheyden and others added 3 commits October 28, 2021 19:05

Merge pull request #1 from mdisibio/drog-tag-cache-2

f0ceedc

Drop tag cache 2

Merge branch 'main' into drop-tag-cache

b66d696

# Conflicts: # CHANGELOG.md

Merge branch 'main' into drop-tag-cache

15292e1

yvrhdn added this to the v1.3 milestone Oct 29, 2021

Koenraad Verheyden added 2 commits November 8, 2021 13:23

Merge branch 'main' into drop-tag-cache

1fd1015

# Conflicts: # CHANGELOG.md

Merge branch 'main' into drop-tag-cache

6fb4a90

joe-elliott approved these changes Nov 8, 2021

View reviewed changes

annanay25 added 2 commits November 25, 2021 18:05

Merge branch 'main' into pr-1068

6943eb3

Signed-off-by: Annanay <[email protected]>

Add limit on response size for a tag-values query

362d97c

Signed-off-by: Annanay <[email protected]>

annanay25 requested a review from joe-elliott December 1, 2021 13:40

Lint and CHANGELOG

f7e95f0

Signed-off-by: Annanay <[email protected]>

annanay25 force-pushed the drop-tag-cache branch from f76ea92 to f7e95f0 Compare December 1, 2021 13:47

mdisibio approved these changes Dec 1, 2021

View reviewed changes

lint, fix test by adding userID to ctx

535263c

Signed-off-by: Annanay <[email protected]>

annanay25 force-pushed the drop-tag-cache branch from 22739f1 to 535263c Compare December 2, 2021 11:10

annanay25 added 2 commits December 2, 2021 16:54

Merge branch 'main' into pr-1068

c33b392

Signed-off-by: Annanay <[email protected]>

Merge branch 'main' into pr-1068

c0d8ac5

Signed-off-by: Annanay <[email protected]>

annanay25 merged commit 6ec3eb8 into grafana:main Dec 2, 2021

annanay25 mentioned this pull request Dec 3, 2021

Add overrides configmap to querier, update docs for new limit #1153

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Search: drop use of TagCache, extract tags and tag values on-demand #1068

Search: drop use of TagCache, extract tags and tag values on-demand #1068

yvrhdn commented Oct 21, 2021 •

edited

Loading

yvrhdn commented Oct 21, 2021

yvrhdn commented Oct 22, 2021 •

edited

Loading

yvrhdn commented Nov 8, 2021

joe-elliott left a comment

yvrhdn commented Nov 8, 2021

mdisibio left a comment

Search: drop use of TagCache, extract tags and tag values on-demand #1068

Search: drop use of TagCache, extract tags and tag values on-demand #1068

Conversation

yvrhdn commented Oct 21, 2021 • edited Loading

yvrhdn commented Oct 21, 2021

yvrhdn commented Oct 22, 2021 • edited Loading

yvrhdn commented Nov 8, 2021

joe-elliott left a comment

Choose a reason for hiding this comment

yvrhdn commented Nov 8, 2021

mdisibio left a comment

Choose a reason for hiding this comment

yvrhdn commented Oct 21, 2021 •

edited

Loading

yvrhdn commented Oct 22, 2021 •

edited

Loading